1646 



PAGE: 1 RAW SEQUENCE LISTING DATE: 1 1/06/98 ^ ^ 

PATENT APPLICATION VSmm9,177 TIME: 17:12:47 



INPUT SET: S29620.mw 



This Raw Listing contains the General 
Information Section and up to the first 




1 SEQUENCE LISTING 

2 

3 ( 1 ) General Information: 
4 

5 (i) APPLICANT: Kohei MIYAZONO; Takeshe IMAMURA; Peter DEN DUKE 
6 

7 (ii) TITLE OF INVENTION: ISOLATED ALK-1 PROTEIN, NUCLEIC ACIDS ENCODING 

8 IT, AND USES THEREOF 
9 

10 (iii) NUMBER OF SEQUENCES: 29 
11 

12 (iv) CORRESPONDENCE ADDRESS: 

13 (A) ADDRESSEE: Fulbright & Jaworski L.L.P. 

14 (B) STREET: 805 Third Avenue 

15 (C) CITY: New York City 

16 (D) STATE: New York 

17 (E) COUNTRY: USA 

18 (F) ZIP: 10022 
19 

20 (V) COMPUTER READABLE FORM: 

21 (A) MEDIUM TYPE: Diskette, 3.25 inch, 1.44mb 

22 (B) COMPUTER: IBM PS/2 

2 3 (C) OPERATING SYSTEM: PC-DOS 
24 (D) SOFTWARE: Wordperfect 

25 

26 (vi) CURRENT APPLICATION DATA: 

27 (A) APPLICATION NUMBER: 09/039,177 

28 (B) FILING DATE: March 13, 1998 

29 (C) CLASSIFICATION: 435 
30 

31 (vii) PRIOR APPLICATION DATA: 

32 (A) APPLICATION NUMBER: PCT/GB93/0236 

33 (B) FILING DATE: November 17, 1993 
34 

35 (vii) PRIOR APPLICATION DATA: 

36 (A) APPLICATION NUMBER: GB 9224057.1 

37 (B) FILING DATE: November 17, 1992 
38 

3 9 (vii) PRIOR APPLICATION DATA: 

40 (A) APPLICATION NUMBER: GB 9304677.9 

41 (B) FILING DATE: March 8, 1993 
42 

4 3 (vii) PRIOR APPLICATION DATA: 

44 (A) APPLICATION NUMBER: GB 9304680.3 

45 (B) FILING DATE: March 8, 1993 
46 
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47 (Vii) PRIOR APPLICATION DATA: 

48 (A) APPLICATION NUMBER: 9311047.6 

49 (B) FILING DATE: May 28, 1993 
50 

51 (vii) PRIOR APPLICATION DATA: 

52 (A) APPLICATION NUMBER: 9313763 • 6 

53 (B) FILING DATE: July 2, 1993 
54 

55 (vii) PRIOR APPLICATION DATA: 

56 (A) APPLICATION NUMBER: 9136099.2 

57 (B) FILING DATE: August 3, 1993 
58 

5 9 (vii) PRIOR APPLICATION DATA: 

60 (A) APPLICATION NUMBER: 321344.5 

61 (B) FILING DATE: October 15, 1993 
62 

63 (viii) ATTORNEY/AGENT INFORMATION: 

64 (A) NAME: Norman D. Hanson 

65 (B) REGISTRATION NUMBER: 30,946 

66 (C) REFERENCE/ DOCKET NUMBER: LUD 55 39 - JEL/NDH 
67 

68 (ix) TELECOMMUNICATION INFORMATION: 

69 (A) TELEPHONE: (212) 688-9200 

70 (B) TELEFAX: (212) 838-3884 
71 

72 (2) INFORMATION FOR SEQ ID NO: 1: 

73 (i) SEQUENCE CHARACTERISTICS: 

74 (A) LENGTH: 1984 base pairs 

75 (B) TYPE: nucleic acid 

76 (C) STRANDEDNESS: unknown 

77 (D) TOPOLOGY: linear 
78 

7 9 (ii) MOLECULE TYPE: cDNA 

80 (iii) HYPOTHETICAL: NO 

81 (iii) ANTI-SENSE: NO 

82 (V) FRAGMENT TYPE: internal 

83 (vi) ORIGINAL SOURCE: 

84 (A) ORGANISM: Homo sapiens 
85 (ix) FEATURE: 

86 (A) NAME/KEY: CDS 

87 (B) LOCATION: 283.. 1791 
88 

89 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
90 

91 AGGAAACGGT TTATTAGGAG GGAGTGGTGG AGCTGGGCCA GGCAGGAAGA CGCTGGAATA 6 0 
92 

93 AGAAACATTT TTGCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 
94 

95 GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 
96 

97 CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 
98 

99 AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CC ATG ACC TTG GGC 2 94 
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100 Met Thr Leu Gly 

101 1 
102 

10 3 TCC CCC AGG AAA GGC CTT CTG ATG CTG CTG ATG GCC TTG GTG ACC CAG 342 

104 Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala Leu Val Thr Gin 

105 5 10 15 20 
106 

107 GGA GAG COT GTG AAG CCG TCT CGG GGC CCG CTG GTG ACC TGC ACG TGT 390 

108 Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val Thr Cys Thr Cys 

109 25 30 35 
110 

111 GAG AGC CCA CAT TGC AAG GGG CCT ACC TGC CGG GGG GCC TGG TGC ACA 4 38 

112 Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly Ala Trp Cys Thr 

113 40 45 50 
114 

115 GTA GTG CTG GTG CGG GAG GAG GGG AGG CAC CCC CAG GAA CAT CGG GGC 486 

116 Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin Glu His Arg Gly 

117 55 60 65 
118 

119 TGC GGG AAC TTG CAC AGG GAG CTC TGC AGG GGG CGC CCC ACC GAG TTC 5 34 

120 Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg Pro Thr Glu Phe 

121 70 75 80 
122 

123 GTC AAC CAC TAC TGC TGC GAC AGC CAC CTC TGC AAC CAC AAC GTG TCC 582 

124 Val Asn His Tyr Cys Cys Asp Ser His Leu Cys Asn His Asn Val Ser 

125 85 90 95 100 
126 

127 CTG GTG CTG GAG GCC ACC CAA CCT CCT TCG GAG CAG CCG GGA ACA GAT 6 30 

128 Leu Val Leu Glu Ala Thr Gin Pro Pro Ser Glu Gin Pro Gly Thr Asp 

129 105 110 115 
130 

131 GGC CAG CTG GCC CTG ATC CTG GGC CCC GTG CTG GCC TTG CTG GCC CTG 678 

132 Gly Gin Leu Ala Leu lie Leu Gly Pro Val Leu Ala Leu Leu Ala Leu 

133 120 125 130 
134 

135 GTG GCC CTG GGT GTC CTG GGC CTG TGG CAT GTC CGA CGG AGG CAG GAG 726 

136 Val Ala Leu Gly Val Leu Gly Leu Trp His Val Arg Arg Arg Gin Glu 

137 135 140 145 
138 

139 AAG CAG CGT GGC CTG CAC AGC GAG CTG GGA GAG TCC ACT CTC ATC CTG 774 

140 Lys Gin Arg Gly Leu His Ser Glu Leu Gly Glu Ser Ser Leu lie Leu 

141 150 155 160 
142 

14 3 AAA GCA TCT GAG CAG GGC GAC ACG ATG TTG GGG GAC CTC CTG GAC AGT 822 

144 Lys Ala Ser Glu Gin Gly Asp Thr Met Leu Gly Asp Leu Leu Asp Ser 

145 165 170 175 180 
146 

147 GAC TGC ACC ACA GGG AGT GGC TCA GGG CTC CCC TTC CTG GTG CAG AGG 870 

148 Asp Cys Thr Thr Gly Ser Gly Ser Gly Leu Pro Phe Leu Val Gin Arg 

149 185 190 195 
150 

151 ACA GTG GCA CGG CAG GTT GCC TTG GTGGAG TGT GTG GGA AAA GGC CGC 918 

152 Thr Val Ala Arg Gin Val Ala Leu Val Glu Cys Val Gly Lys Gly Arg 
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153 200 205 210 

154 

155 TAT GGC GAA GTG TGG CGG GGC TTG TGG CAC GGT GAG AGT GTG GCC GTC 966 

156 Tyr Gly Glu Val Trp Arg Gly Leu Trp His Gly Glu Ser Val Ala Val 

157 215 220 225 
158 

15 9 AAG ATC TTC TCC TCG AGG GAT GAA CAG TOO TGG TTC CGG GAG ACT GAG 1014 

160 Lys lie Phe Ser Ser Arg Asp Glu Gin Ser Trp Phe Arg Glu Thr Glu 

161 230 235 240 
162 

163 ATC TAT AAC ACA GTA TTG CTC AG A CAC GAC AAC ATC CTA GGC TTC ATC 1062 

164 lie Tyr Asn Thr Val Leu Leu Arg His Asp Asn lie Leu Gly Phe lie 

165 245 250 255 260 
166 

167 GCC TCA GAC ATG ACC TCC CGC AAC TCG AGC ACG CAG CTG TGG CTC ATC 1110 

168 Ala Ser Asp Met Thr Ser Arg Asn Ser Ser Thr Gin Leu Trp Leu lie 

169 265 270 275 
170 

171 ACG CAC TAG CAC GAG GAC GGC TCC CTC TAG GAC TTT CTG CAG AOA CAG 1158 

172 Thr His Tyr His Glu His Gly Ser Leu Tyr Asp Phe Leu Gin Arg Gin 

173 280 285 290 
174 

175 ACG CTG GAG CCC CAT CTG GCT CTG AGG CTA GCT GTG TCC GCG GCA TGC 1206 

176 Thr Leu Glu Pro His Leu Ala Leu Arg Leu Ala Val Ser Ala Ala Cys 

177 295 300 305 
178 

17 9 GGC CTG GCG CAC CTG CAC GTG GAG ATC TTC GGT ACA CAG GGC AAA CCA 1254 

180 Gly Leu Ala His Leu His Val Glu lie Phe Gly Thr Gin Gly Lys Pro 

181 310 315 320 
182 

18 3 GCC ATT GCC CAC CGC GAC TTC AAG AGC CGC AAT GTG CTG GTC AAG AGC 1302 

184 Ala lie Ala His Arg Asp Phe Lys Ser Arg Asn Val Leu Val Lys Ser 

185 325 330 335 340 
186 

187 AAC CTG CAG TGT TGC ATC GCC GAC CTG GGC CTG GCT GTG ATG CAC TCA 1350 

188 Asn Leu Gin Cys Cys lie Ala Asp Leu Gly Leu Ala Val Met His Ser 

189 345 350 355 
190 

191 CAG GGC AGC GAT TAG CTG GAC ATC GGC AAC AAC CCG AG A GTG GGC ACC 1398 

192 Gin Gly Ser Asp Tyr Leu Asp lie Gly Asn Asn Pro Arg Val Gly Thr 

193 360 365 370 
194 

195 AAG CGG TAG ATG GCA CCC GAG GTG CTG GAC GAG CAG ATC CGC ACG GAC 1446 

196 Lys Arg Tyr Met Ala Pro Glu Val Leu Asp Glu Gin lie Arg Thr Asp 

197 375 380 385 
198 

199 TGC TTT GAG TCC TAG AAG TGG ACT GAC ATC TGG GCC TTT GGC CTG GTG 14 94 

200 Cys Phe Glu Ser Tyr Lys Trp Thr Asp lie Trp Ala Phe Gly Leu Val 

201 390 395 400 
202 

203 CTG TGG GAG ATT GCC CGC CGG ACC ATC GTG AAT GGC ATC GTG GAG GAC 1542 

204 Leu Trp Glu lie Ala Arg Arg Thr lie Val Asn Gly lie Val Glu Asp 

205 405 410 415 420 
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206 

207 TAT AG A CCA CCC TTC TAT GAT GTG GTG CCC AAT GAC CCC AGC TTT GAG 1590 

208 Tyr Arg Pro Pro Phe Tyr Asp Val Val Pro Asn Asp Pro Ser Phe Glu 

209 425 430 435 
210 

211 GAC ATG AAG AAG GTG GTG TGT GTG GAT CAG CAG ACC CCC ACC ATC CCT 1638 

212 Asp Met Lys Lys Val Val Cys Val Asp Gin Gin Thr Pro Thr lie Pro 

213 440 445 450 
214 

215 AAC CGG CTG GCT GCA GAC CCG GTC CTC TCA GGC CTA GCT CAG ATG ATG 1686 

216 Asn Arg Leu Ala Ala Asp Pro Val Leu Ser Gly Leu Ala Gin Met Met 

217 455 460 465 
218 

219 CGG GAG TGC TGG TAC CCA AAC CCC TCT GCC CGA CTC ACC GCG CTG CGG 17 34 

220 Arg Glu Cys Trp Tyr Pro Asn Pro Ser Ala Arg Leu Thr Ala Leu Arg 

221 470 475 480 
222 

223 ATC AAG AAG AC A CTA CAA AAA ATT AGC AAC AGT CCA GAG AAG CCT AAA 1782 

224 lie Lys Lys Thr Leu Gin Lys lie Ser Asn Ser Pro Glu Lys Pro Lys 

225 485 490 495 500 
226 

227 GTG ATT CAA TAGCCCAGQA GCACCTGATT CCTTTCTGCC TGCAGGGGGC 1831 

228 Val He Gin 
229 

230 TGGGGGGGTG GGGGGCAGTG GATGGTGCCC TATCTGGGTA GAGGTAGTGT GAGTGTGGTG 1891 
231 

2 32 TGTGCTGGGG ATGGGCAGCT GCGCCTGCCT GCTCGGCCCC CAGCCCACCC AGCCAAAAAT 1951 
233 

2 34 ACAGCTGGGC TGAAACCTGA AAAAAAAAAA AAA 1984 
235 

236 (2) INFORMATION FOR SEQ ID NO: 2: 
2 37 (i) SEQUENCE CHARACTERISTICS: 

238 (A) LENGTH: 503 amino acids 

239 (B) TYPE: amino acid 

240 (D) TOPOLOGY: linear 

241 (ii) MOLECULE TYPE: protein 
242 

24 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

244 

245 Met Thr Leu Gly Ser Pro Arg Lys Gly Leu Leu Met Leu Leu Met Ala 

246 15 10 15 
247 

248 Leu Val Thr Gin Gly Asp Pro Val Lys Pro Ser Arg Gly Pro Leu Val 

249 20 25 30 
250 

251 Thr Cys Thr Cys Glu Ser Pro His Cys Lys Gly Pro Thr Cys Arg Gly 

252 35 40 45 
253 

254 Ala Trp Cys Thr Val Val Leu Val Arg Glu Glu Gly Arg His Pro Gin 

255 50 55 60 
256 

257 Glu His Arg Gly Cys Gly Asn Leu His Arg Glu Leu Cys Arg Gly Arg 

258 65 70 75 80 
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****** PREVIOUSLY ERRORED SEQUENCES - EDITED ***** 



2767 (2) INFORMATION FOR SEQ ID NO: 29: 

2768 (i) SEQUENCE CHARACTERISTICS: 

2769 (A) LENGTH: 6 amino acids 

2770 (B) TYPE: amino acid 

2771 (D) TOPOLOGY: linear 

2772 (ii) MOLECULE TYPE: peptide 
2773 

2774 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
2775 

2776 Gly Thr Lys Arg Tyr Met 

2777 1 5 
2778 
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